Aspects of Pattern-matching in Data-Oriented Parsing
نویسنده
چکیده
Data-Oriented Parsing (dop) ranks among the best parsing schemes, pairing state-of-the art parsing accuracy to the psycholinguistic insight that larger chunks of syntactic structures are relevant grammatical and probabilistic units. Parsing with the dop-model, however, seems to involve a lot of CPU cycles and a considerable amount of double work, brought on by the concept of multiple derivations, which is necessary for probabilistic processing, but which is not convincingly related to a proper linguistic backbone. It is however possible to reinterpret the dop-model as a pattern-matching model, which tries to maximize the size of the substructures that construct the parse, rather than the probability of the parse. By emphasizing this memory-based aspect of the dop-model, it is possible to do away with multiple derivations, opening up possibilities for eÆcient Viterbistyle optimizations, while still retaining acceptable parsing accuracy through enhanced context-sensitivity.
منابع مشابه
Aspects Of Pattern-Matching In Data-Oriented Parsing
Data-Oriented Parsing (DOP) ranks mnong the best parsing schemes, pairing state-of-the art parsing accuracy to the psycholinguistic insight that larger clmnks of syntactic structures are relevant grammatical and probabilistic units. Parsing with the DOp-model~ however, seems to involve a lot of CPU cycles and a considerable amomtt of double work, brought on by the concept of multiple derivation...
متن کاملMatching Scores of System Relevance and User-Oriented Relevance in SID, ISC and Google Scholar
Background and Aim: The main aim of Information storage and retrieval systems is keeping and retrieving the related information means providing the related documents with users’ needs or requests. This study aimed to answer this question that how much are the system relevance and User- Oriented relevance are matched in SID, SCI and Google Scholar databases. Method: In this study 15 keywords of ...
متن کاملHierarchical Maximum Pattern Matching with Rule Induction Approach for Sentence Parsing
Chinese parsing has been a highly active research area in recent years. This paper describes a hierarchical maximum pattern matching to integrate rule induction approach for sentence parsing on traditional Chinese parsing task. We have analyzed and extracted statistical POS (part-of-speech) tagging information from training corpus, then used the related information for labeling unknown words in...
متن کاملParsing for Data Exchange in Coupled MEMS CAD
We present a new approach to handle the data exchange between application programs in performing coupled micro-electro-mechanical system (MEMS) simulation. With existing techniques, input data is extracted via a close interaction between each application program and a parser, which performs pattern matching and possibly executes semantic actions. Such a strong coupling application-parser intera...
متن کاملAn improved joint model: POS tagging and dependency parsing
Dependency parsing is a way of syntactic parsing and a natural language that automatically analyzes the dependency structure of sentences, and the input for each sentence creates a dependency graph. Part-Of-Speech (POS) tagging is a prerequisite for dependency parsing. Generally, dependency parsers do the POS tagging task along with dependency parsing in a pipeline mode. Unfortunately, in pipel...
متن کامل